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We calculate the thermodynamic entropy of the mean-field 4> 4 spin model in the microcanonical 
ensemble as a function of the energy and magnetization of the model. The entropy and its derivative 
are obtained from the theory of large deviations, as well as from Rugh's microcanonical formalism, 
which is implemented by computing averages of suitable observables in microcanonical molecular 
dynamics simulations. Our main finding is that the entropy is a concave function of the energy for all 
values of the magnetization, but is nonconcave as a function of the magnetization for some values of 
the energy. This last property implies that the magnetic susceptibility of the model can be negative 
when calculated microcanonically for fixed values of the energy and magnetization. This provides a 
magnetization analog of negative heat capacities, which are well-known to be associated in general 
with the nonequivalence of the microcanonical and canonical ensembles. Here, the two ensembles 
that are nonequivalent are the microcanonical ensemble in which the energy and magnetization are 
held fixed and the canonical ensemble in which the energy and magnetization are fixed only on 
average by fixing the temperature and magnetic field. 

PACS numbers: 05.20.-y, 05.20.Gg, 05.70.Fh 



I. INTRODUCTION 

The Legendre transform connecting the entropy function of the microcanonical ensemble and the free energy of the 
canonical ensemble in the thermodynamic limit of these ensembles is often used in practice to obtain the entropy of 
a system from the knowledge of its free energy. Unfortunately, as has been stressed repeatedly in studies of systems 
involving long-range interactions [l], 0, 0, 0, H, Q , the entropy is the Legendre transform of the free energy only if 
the entropy is concave as a function of the energy. If the entropy is a nonconcave function of the energy, as it often 
happens in long-range systems, then it cannot be calculated as the Legendre transform of the free energy. In this 
case, one must resort to analytically obtain the entropy by other means, e.g., by evaluating directly the density of 
states from which the entropy is defined (see, e.g., Refs. 0, H|j), or by using large deviation techniques based on the 
microcanonical ensemble [il. Is. [ToL ITT1. [l2| . Another possibility put forward recently works by modifying the definition 
of the canonical ensemble in such a way that nonconcave entropies can be obtained from the Legendre transform of a 
modified form of free energy [l3l . [lij . Examples of applications of this generalized canonical ensemble can be in found 
in Refs. [TIGl. 

There are many types of statistical models whose entropy is known to be a nonconcave function of the energy. 
Examples, listed in the order in which they were discovered, include systems of particles interacting through gravi- 
tational forces [Tel , Il9l . l20j , a model of plasma [2lJ , statistical models of two-dimensional turbulence (22|, [23| , as 
well as several spin models involving long-range and mean- field interactions @, H, H, OH • Our goal in this paper is to 
study yet another model, which departs somewhat from the models listed above in that its microcanonical entropy 
is nonconcave as a function of its energy and magnetization. This, as we shall see, has many consequences for the 
equivalence of the many different ensembles that can be conceived for this model according to whether the energy 



'Electronic address: alessandro.campa@iss.infn.it 
t Electronic address: stcfano.ruffo@unifi.it 
^Electronic address: ht@maths.qmul.ac.uk 



Typeset by REVTgX 



2 



and magnetization are treated in a canonical or microcanonical way, i.e., whether these quantities are assumed to 
fluctuate or not. Parallels between these cases of nonequivalent ensembles and those studied in the context of the 
energy alone will be discussed. In particular, we shall see that one consequence of the nonconcavity of the entropy 
with the added magnetization is that the magnetic susceptibility of the model, calculated microcanonically, can be 
negative. The parallel for systems having nonconcave entropies as a function of the energy is that the heat capacity 
can be negative 0, [3, Q3, M, HI • 

The possibility for the entropy to be nonconcave in a variable other than the energy or in more than one variable 
is discussed by Ellis, Haven and Turkington [J] in the context of their general theory of nonconcave entropies and 
nonequivalent ensembles. These authors presented in Ref. [23| a first application of their theory for a statistical model 
of two-dimensional turbulence, whose entropy is nonconcave as a function of the energy and circulation. A model 
similar to the one treated here having a nonconcave entropy as a function of its energy and magnetization was also 
studied recently by Hahn and Kastner in Refs. l^, 26 1 . What we present here can be seen as a continuation and an 
extension of these papers. The difference with Refs. 12, 26] is that we present not one but two methods for calculating 



the entropy as a function of the energy and magnetization: one based on large deviation theory and a second based on 
Rugh's microcanonical formalism (27|, HH . Additionally, we discuss some of the relationships that exist between the 
nonconcavity of the entropy, the nonequivalence of the microcanonical and canonical ensembles, and the appearance 
of first-order phase transitions in the canonical ensemble. 



II. THE MODEL AND ITS THERMODYNAMIC DESCRIPTION 

The model that we study in this paper is the so-called mean-field </> 4 model, defined by the Hamiltonian 

N ' 2 1 1 \ 1 N 

Pi 1 2 . 1 4 \ 1 



C l 4*V AN 

In this expression, qi and pi are the canonical coordinates of unit mass particles moving on a line (qi,Pi £ K). These 
particles are subjected to a local double-well potential, in addition to interact with each other through a mean-field 
(infinite range) interaction given by the all-to-all coupling in the double sum. In the following, we shall use x to 
denote a point of the phase space T of the system, i.e., x — ({pi}, {%})• 

The mean-field 4 model was introduced by Desai and Zwanzig [29( who studied its relaxation to equilibrium using 
Langevin dynamics. More recently, Dauxois, Lepri and Ruffo [301 ] have studied this model in the canonical ensemble, 
and showed that it exhibits a second-order ferromagnetic phase transition. The critical temperature of the transition 
is found to be T c w 0.264, corresponding to a critical energy per particle or mean energy e c = T c /2 w 0.132. The 
steps leading to the calculation of the entropy of the mean-field </> 4 model augmented by an extra magnetic field were 
also presented recently by two of us in Ref. [31(. In the same paper, molecular dynamics simulations performed above 
the critical energy are reported in an attempt to study the convergence of finite-iV averages calculated in Rugh's 
microcanonical formalism. Finally, as mentioned in the introduction, Hahn and Kastner [l2l [261 ] have studied a mean- 
field </> 4 model similar to the one studied here. They have calculated for their model the thermodynamic entropy as 
a function of the mean energy e(x) = H(x)/N and the mean magnetization, defined as 

1 N 

™>( X ) = Z2 Qi- ( 2 ) 

1=1 



We recall that, in terms of these two quantities, the definition of the entropy is 

1 f 
lim — In / 

AT->oo N J r 



1 f 

s(e, m ) = lim — In / 5(e(x) — e)S(m(x) — m)dx. (3) 



Naturally, in terms of e alone, we have 



s(e) — lim — In / S (e(x) — e) dx. (4) 



r 



Our goal in the next sections is to calculate s(e, m) for the Hamiltonian defined in |T]). In doing so, we shall try 
to highlight the difficulties that arise in calculating this function due to the fact that it is nonconcave, in addition to 
discuss the physical consequences of having a nonconcave entropy in two thermodynamic variables. The fundamental 
difficulty, for what concerns calculations, is basically the following. If s(e, m) were concave, then we could calculate 
this function from the point of view of the canonical ensemble using the following steps: 
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(i) Calculate the partition function 

Z{fi,r,) = I e-^^'^^dx, (5) 



r 

where M(x) = Nm(x) is the total magnetization. 

(ii) Calculate the thermodynamic free energy function of the model defined by 

¥>(M = - Jim ^lnZ(A?y). (6) 

N—>oc iv 

(iii) Obtain s(e,m) by taking the Legendre transform of free energy function ip(fi,r]); in symbols, 

s(s,m) = fie + rjm — tp(/3,rj), (7) 

with fi and rj determined by the equations 

d d 

-ggViP' V)=^ g^ l P{0^)=m. (8) 

The problem with nonconcave entropies is that the last step does not actually yield the correct entropy function 
because Legendre transforms yield only concave functions. To circumvent this problem, we proceed in the next section 
to obtain s[£,to) using another method sug gested by large deviations jij. The method is the same as that used in 
Refs. [HI, 0, HH (see also Refs. 0, [HI ITlf). The results obtained will then be compared with those derived with 
Rugh's method. This second method is the subject of Sec. HVl The results obtained by the two methods are reported, 
compared and discussed in Sec. [V] 

Before we jump to the next section, it is useful to note that the partition function shown in ([5]) can be re-written 
in the more familiar form 

Z(fi,h) = [ e - p[H(x) ' hMi - x)] dx (9) 



Jr 

by defining rj = —fih. This shows that the partition function defined in ([5]) is nothing but the standard, canonical 
partition function of H with an added external magnetic field h. As is well known, h is the field of the canonical 
ensemble which is conjugated to the magnetization constraint m(x) — to of the microcanonical ensemble, while fi is 
the canonical field conjugated to the microcanonical energy constraint e{x) = e. These observations are important 
for what is coming later. 



III. LARGE DEVIATION CALCULATION OF THE ENTROPY 



The large deviation method described in this section is essentially a generalization of the maximum entropy principle, 
which is particularly suited for many-particle systems involving long-range or mean-field interactions. The method 
is presented in detail in Ref. and was used recently to calculate the entropy function of many models; see, e.g., 
Refs. SEini- 

The basic ingredients of the method are the following. First, one must be able to find a set of macro-variables or 
"mean fields" , denoted collectively by the vector fi(x), which are such that the energy per particle e{x) and the mean 
magnetization m{x) can be re-written as a function of these variables. In symbols, this means that there must exist 
two functions 1 e and to of /i such that 

e{x) — e(/i(a:)), m{x) — rh(fi(x)). (10) 

(See Ref. [1] or Q for a more complete and more accurate statement of this condition.) Second, one must be able to 
derive the expression of the entropy function s(/i) for the macrostate [i, which, in analogy with s(e) and s(e, m), is 
defined as 

s(fj,) = Jim iln / 6(fi(x) - n)dx. (11) 



1 Functions referring to the macrostate fi bear a tilde to distinguish them from those referring to e and m. 
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In large deviation theory, the function s(/i) is interpreted as the rate function (up to a sign) governing the fluctuations 
of [i with respect to the probability measure defining the microcanonical ensemble [4| . With this function, we finally 
obtain the entropy s(e, m) by solving a constrained maximization problem given by 

s(s,m) = sup s(/i). (12) 

fi-.e(^)— €,rh(f-i)= m 

This formula is the generalized maximum-entropy principle that we alluded to above. In large deviation theory, such 
a formula is referred to as a contraction formula or contraction principle (32l . |33| . 

The real challenge in solving the variational problem of (| 12[) is not so much to solve the constrained maximization, 
but to derive a priori the expression of s(fi). An implicit hope of the large deviation method, in this respect, is that 
s(fj) be a concave function of the selected macrostate fi. In this case, the calculation of s(/i) is facilitated by the fact 
that s(n) is the Legendre transform of some properly-defined free energy function. Specifically, define 

Z(X) = [ e~ NX -^dx (13) 



to be the partition function of the observable fJi(x). In this expression, A • fi(x) stands for the usual scalar product of 
the two vectors A and n(x). Now, let 

0(A) =- Hm -J-lnZ(A) (14) 

be the free energy function associated with fj,(x). Then, assuming that s{n) is concave, we have 

S(/i) = inf{A • /x - 0(A)}. (15) 

This equation is the macrostate generalization of the Legendre transform defined by Eqs. ([7]) and ([8]). It is now a 
valid equation for calculating s(/x) because the latter is assumed to be concave. 

All of the steps just described work for the mean-field (j) 4 model. To start, it is easy to see that a good choice of 
macrostate for this model is the vector \jl = (to, k, v) composed of the mean magnetization to, the mean kinetic energy 

1 N 

mT.pI (16) 



2N 

i=i 



and the mean potential energy 



1 N 

i=l 



In terms of /i, we indeed have 



e(f,) = k + v-—, (18) 



and since fi already includes the mean magnetization to, there is no need to define a function to. 

To calculate the entropy s(fi), we follow the Legendre transform path, anticipating that s(/x) is concave. Since each 
component of \x is additive, the partition function for /i has the form 

Z{\) = Z(X m , Afc, A„) = [Z k (X k )} N [Z m>v (X m ,X v )] N , (19) 

where 

Z k (X k )= r e' x ^/ 2 dp= x f^ (20) 

J-oo V *k 

and 

/>oo 

Z m , v {X m ,X v )= e-^-^-i 2 )/^. (21) 
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Note that in order for Zfc(Afc) to exist, Afc > 0; similarly, X v > above. From the expression of the partition function, 
we write the expression of the free energy function of /x as 

</?(A m , Afc, X v ) — </3fc(Afc) + &m,v{X m , X v ), (22) 

where 

^fc(Afe) = -InZk(Xk) = ^ In A fe - iln(27r), (23) 

and ifm tV (X m , X v ) = — lnZ m ^ v (X m ,X v ). In order to be able to express s(m,k,v) as the Legendre transform of 
if(X m , Afc, A„), we now have to verify that s(m,k,v) is concave. One way of verifying this is to verify that <fk{Xk) 
is a differentiable function of all its arguments. 2 This, as is easily verified, is indeed the case, so we can proceed to 
calculate the Legendre transform shown in Eq. (115|) ; the result is 

s(m, k, v) = Sfc(fc) + s(m 7 v), (24) 

where 

S k (k) = inf{Afefc - (^fc(Afc)} = ~ In fc + J ln(47re) (25) 

Afc Z Z 

and 

s m ,v(m,v) = inf {X m m + X v v - (p m , v (X m , X v )}. (26) 

A m eR,A„>0 

Since tp m ,v(X m , X v ) is differentiable, the last equation can actually be re-written as 

8 m ,v(m,v) = X m (m,v)m + X v (m,v)v - (p m v (X m (m,v),X v (m,v)), (27) 

where X m (m,v) and X v (m,v) are the unique solutions of the two equations 

d d 
——ip(X m , X v ) = m, — — <^(A m , X v ) = v. (28) 

We clearly see in these equations the familiar form of the Legendre transform; compare them with Eqs. ([7]) and ([H]). 

We have now reached the last step of the calculation of s(e,m), which is to solve the constrained maximization 
problem displayed in equation (|12|1 . In our case, this equation takes the form 

s(e,m) = sup {sfc(fc) + s m ^ v (m, v)}. (29) 

k^v.k+v— e+m 2 /4 

With the explicit expression of Sfc(fc) shown in (f2"5| . this can be re- written as 

s(e,m) = sup \-ln(e-v + ^-) + - ln(47re) + s mjV (m,v) \ . (30) 



Thus, in the end, to find s(e, m) we have to solve a one-dimensional, unconstrained maximization problem involving the 
two-dimensional function s m _ v (m, v). The calculation of s mjlI (m, v) calls itself for the solution of the two-dimensional 
minimization problem shown in Eq. (|26p. which involves the free energy (p m ,v{X m , X v ). Note that the mean potential 
energy v is constrained to lie in the range [—1/16, e + m 2 /4). The lower bound of this range arises naturally as the 
minimum of the double-well potential, whereas the upper bound arises because k = e — v + rn 2 /4 > 0. Finally, we 
note that the parity of the Hamiltonian {1} in the coordinates q implies that the function s(e, m) will be even in m. 



2 This result is at the root of the problem of ensemble equivalence. If the free energy is differentiable, then its Legendre transform yields 
the correct concave entropy (strictly concave, in fact). See Ref. [jj |25H for more information. 
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IV. RUGH'S FORMALISM AND MOLECULAR DYNAMICS SIMULATIONS 



The results of the large deviation method just outlined will be compared in the next section with results obtained 
from Rugh's microcanonical formalism [27], [28j . The idea behind this formalism is to perform molecular dynamics 
simulations of the Hamiltonian system H, and to compute the time average of certain observables along the system's 
trajectory. Assuming that the dynamics of the system is ergodic, one then equates the time average of these observables 
with their microcanonical ensemble averages. In this context, what Rugh's formalism provides is a general prescription 
for estimating important thermodynamic quantities of the microcanonical ensemble as time averages of suitably-chosen 
dynamical observables. 

In a previous paper [3l| , we have described the implementation of Rugh's formalism for the mean-field c/> 4 model 
augmented by a magnetic field. The main results of this paper, adapted to our specific model without the magnetic 
field, are the following. Let H(x, M) denote the Hamiltonian of a system whose magnetization is constrained to 
have the value M. How this constraint is to be put in the original Hamiltonian H(x) will be discussed below. The 
microcanonical entropy of the model as a function of the total energy E and total magnetization M is defined as 

S(E, M) = ln J s (H(x; M) - E) dx, (31) 

while the average of a general observable A(x) in the microcanonical ensemble is given by 



6 (H(x, M) - E) A(x)dx 

(A) EM = J -^j • (32) 

S(H(x; M) ~ E) dx 



r 

We consider now the total (extensive) entropy of the system rather than the (intensive) entropy density s(e, m) 
because we want to keep track of the ./V-dependence of the entropy. As usual, relevant thermodynamic quantities like 
the temperature, the specific heat and the magnetic susceptibility are defined through derivatives of the entropy In 
Rugh's formalism, these derivatives are calculated by choosing a vector Y in T such that Y ■ VH = 1. In terms of Y, 
we then have 



i E S(E,M ) = { W.Y Ku = ^ M] 



— S(E, M) = (V • Y) EM = (33) 



and 



dM K ' ; \ \dM 



S(E,M) = - V- — F . (34) 



E.AI 



The first derivative with respect to E defines, as always, the inverse temperature T(E, M) 1 of the system. The 
derivatives of {A) e,m are computed similarly as 

^ (A) EM = (V • (AY)) EM - (A) EM (35) 

and 

&^-(v.(^)> w + (v.(«,)> w ^ + (^- ™ 

Equations (|33|) - (|36p are valid for any number N of particles. As mentioned before, for ergodic systems, the ensemble 
averages (. . -)e,m entering in these equations are computed as time-averages obtained through molecular dynamics 
simulations in which the energy and the magnetization are both conserved. The convergence of these time-averages 
towards their thermodynamic (N — > oo) values is studied in Ref. [3lj ] - 

Molecular dynamics simulations performed with H(x) automatically conserve the energy, so it remains to adapt 
them to make sure that they also conserve the magnetization, i.e., that Nm(x) — M at all time. One way to achieve 
this is to explicitly incorporate the magnetization constraint into the Hamiltonian H(x) by eliminating, for example, 
the variable qpj using qjy = M — X^i. 1 Qi- ^ n this way, we obtain a new constrained Hamiltonian H c (x, M) involving 
N — 1 particles and the magnetization M: 

N-l l,N-l N-l , 

\ ^ 2 *■ \ " i \ " / 2 i 4 



i—1 i,j i—1 




7 



Note that the the kinetic energy is no more diagonal because the magnetization constraint couples to the momenta. 

For this Hamiltonian, we follow Rugh 27], and choose the vector Y to have nonvanishing components only in 
correspondence with the kinetic energy. That is, we choose 

Y= 2^(Pi."-»Pw-i,0,...,0), (38) 

where K c is the kinetic part of H c . It is easy to verify that Y ■ VH C = 1. Consequently, we can use Eq. (J33J) to obtain 
the temperature T(E,M); the result is 



T(E, M) \ 2K, 
Similarly, we obtain from (f34|) . 



N - 3 N 

EM 



(39) 



dM v ' ' T{E,M) \ 2K, 
with T(E, M) given by ([39]) and m 3 defined by 



S(E,M) = — ^— - / m 3 ^— - ) (40 ) 



EM 



N 

1 

m 3 



(«) 



N 

i=l 



By defining an effective magnetic field h(E,M) with the usual thermodynamic relation 
we can also put (|40[) in the form 



h(E,M) = -T(E,M)-^S(E,M), (42) 



iV - 3 s 

h(E,M) = -m + T(E,M)(m 3 ——) . (43) 



2K r 



E.M 



Further derivatives of this quantity and of T(E,M), as calculated with Eqs. (|35|) and (136 p . yield the magnetic 
susceptibility and the heat capacity, respectively. 

At this point, it is important to note that T(E, M) and h{E, M) are microcanonical quantities — they are functions 
of the constrained values of the energy and magnetization. These quantities must be distinguish from their canonical 
counterparts, T = (fc_B/3) -1 and h, which are not functions of any other variables — they are the parameters of the 
canonical ensemble. The equivalence of these two ensembles will be explored in the next section. 

For now, we finish this section by discussing another method which can be used to implement the magnetization 
constraint into the molecular dynamics simulations. The method relies on the use of Lagrange multipliers, and works 
by simulating the dynamics of the Hamiltonian 

H V = H + vM, (44) 

which is our original Hamiltonian augmented by the magnetization M and its associated Lagrange multiplier v. The 
equations of motion for H v read 

dH u 8V dM dv . 

Qi = — ^— = --;. v—- = -- v, i = l,...,N. (45) 

dqi dqt dqi dqi 

In this expression, V denotes the potential energy, i.e., V = H — K . An expression for v in terms of the qfs is readily 
obtained from (|4"5"]) by noting that M is constant in time, so that YliLi & = 0. As a result, 

1 N dV 1 N 1 N 

- = -^E^--^E*-^E? 3 =™--3, (46) 

i=l H i=l i=l 

where, for the last equalities, we have used the explicit expression of the potential energy of the mean-field 4 model. 
Inserting this back into Eq. (j45|) leads us, finally, to the following equations of motion 3 : 

'ii = \<li -4 - 7^m + m 3 , 2 = 1,..., N. (47) 



These equations could also be obtained from H c by re-inserting the coordinate into the canonical equations of motion. 



8 



The main advantage of considering H v for performing the molecular dynamics simulations is that it allows a direct 
calculation of the effective magnetic field h(E, M). Indeed, in view of the equations of motion derived from H v and 
the form of i?„, it is natural to interpret the Lagrange multiplier v as the opposite of a fictitious magnetic field which 
adjusts itself in time in order to keep the magnetization constant. The time average of this instantaneous field defines 
another effective magnetic field, denoted by h(E, M), whose explicit expression is 



h(E,M) = -{v) B ,M 



(m 3 ) 



E.M 



(48) 



Now, although this expression is different from the expression of the field h(E, M) obtained in Rugh's formalism, we 
have observed in our simulations that the difference between the two fields is negligible for N large, so that h(E, M) 
and h(E, M) can be considered to be equal for all practical purposes. It can be noted, in fact, that Eq. (|43|) reduces 
to Eq. (138]) if 



m 3 - 



vV-3 
2X7 



(m 3 ) 



E.M 



E.M 



JV-3 
2K, 



(49) 



E.M 



Thus, if 7773 becomes statistically uncorrelated with the kinetic observable, which is something expected to occur in 
the thermodynamic limit, then h(E,M) = h{E,M). 

The calculation of the magnetic susceptibility x{E, M) at constant energy and magnetization, defined by 



1 d 
— = N— — h(E, M), 



(50) 



is also simplified by considering h(E,M) instead of h(E,M). The magnetic susceptibility is calculated in Rugh's 
formalism from Eq. When applied to h(E, M), this equation yields 



1 



X(E,M) 



(iV-3) 



7713(771 — 777,3) 
2K C 



(m 3 ) 



E.M 



E.M 



where, in analogy with 7773, we have introduced the notation 



m 2 



1 N 



(m — 7773) 

2K C 



E.M 



l + 3(m 2 > 



E.M ■ 



(51) 



(52) 



We leave it to the reader to check that the expression of x{E, M) obtained from h(E, M) is more complicated. In the 
end, it is interesting to note that we could have calculated \{E,M) by taking the numerical derivative of h(E,M) 
or h(E,M). This, however, generally has the effect of amplifying the error associated with either field. In Rugh's 
formalism, \(E,M) is calculated as a time-average just like h(E,M) or h(E,M), and so carries a numerical error 
comparable to the error associated with these fields. 



V. COMPARISON AND DISCUSSION OF THE RESULTS 



The entropy density s(e, m) of the mean-field (f> 4 model as a function of its mean energy e and mean magnetization 
777 is shown in Fig. [1] In Fig. [TJa), we have plotted the graph of s(e, m) as a function of 777 for four different values 
of e: one above the critical value e c w 0.132 and three below. To get a better idea of the shape of s(e,m) as a 
two-dimensional function, we also report in Fig. [Tfb) the curves of Fig. QJa) stacked along the e direction. The data 
shown in both of these plots are those obtained via the large deviation method, i.e., via the maximization problem 
displayed in (|30p . It should be noted that this maximization problem cannot be solved analytically because the free 
energy ^ m .„(A m , A„), which intervenes in the derivation of the macrostate entropy function s TO)W (m, v), involves a 
quartic integral; see Eq. pTjl . Nevertheless, this integral, like all the other quantities and optimization problems 
involved in the calculation of s(e,m), can easily be handled numerically with the end result that s(e, m) can easily 
be computed numerically to any desired precision. The specific results that we present in Fig. [1] were obtained with 
Mathematica using the default 16-digit working precision. The resulting numerical error is smaller than the tickness 
of the curves reported in Fig. [T] 

The comparison of the large deviation method with Rugh's method is reported in Figs. [2] and [3J Figure [2] shows 
the results of the calculation of the effective magnetic field h(e,m) using the large deviation method (full line) and 




FIG. 1: (a) Entropy as a function of the mean magnetization m for different values of the mean energy e. (b) 3D view of 
s(e,m). The black dots show the location of the equilibrium values of m in the microcanonical ensemble for each value of e 
(see the text for explanation). 
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FIG. 2: Effective magnetic field calculated using Rugh's method (•) and the large deviation method (full line). The values of e 
are from top to bottom e = 0.16, 0.08, 0.04, and —0.04. In this figure and in the next one the error associated with the results 
of the molecular dynamics simulations is smaller than the filled dots. 



Rugh's method (filled dots). Because s(e, m) is an even function of m, we show h{e, m) only for m > 0. In the former 
method, h(e, m) is calculated from its thermodynamic-limit definition 

h(e,m) = -T(e,m)^^-, (53) 

using the form of s(e, m) shown in Fig.[T] In the latter method, the same field is calculated using Eq. (|43[) or Eq. (|48[) . 
As mentioned earlier, these two equations are expected to yield comparable results so long as the molecular dynamics 
simulations are performed for large-enough systems. In our case, N — 10 2 particles were used, and we found no 
appreciable differences between the fields obtained with Eqs. (|43|) and (|48[) . From Fig.[2j we see that there is a perfect 
agreement between the field h(s, m) obtained from the large deviation approach and the same field obtained from 
Rugh's approach. We thus conclude that the two approaches agree on the calculation of s(e, m), up to a constant which 
can easily be determined. The two approaches perfectly agree also for the calculation of the magnetic susceptibility 
at constant energy; see Fig. [3J 

Having shown that the large deviation approach for calculating s(e, m) gives the same results as those obtained with 
Rugh's approach, we now come to the discussion of the nonequivalence of the microcanonical and canonical ensembles 
in relation to the nonconcavity of s(e, m). 4 A first hint to the effect that the two ensembles are nonequivalent is 



4 By microcanonical ensemble, we mean of course the ensemble in which both the energy and magnetization are fixed. The concommittant 
canonical ensemble is the one in which f3 and h are fixed. 
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FIG. 3: Magnetic susceptibility calculated using Rugh's method (•) and the large deviation method (full line). 



2 
1 

ds{e,m) Q 
dm 

-1 
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-1 -0.5 0.5 1 

m 

FIG. 4: m-derivative of the entropy function shown in Fig. Ola). 

provided by the fact that the effective field h(e, m) can be negative for positive values of m when e < e c , as can be 
seen from Fig. [2j In the canonical ensemble, it can be shown that the equilibrium magnetization of the mean-field 
</> 4 model can only be negative when the canonically-imposed field h is negative, and can only be positive when h is 
positive. 5 In other words, in the canonical ensemble, the magnetization of the (f> 4 model is always in the direction of 
the field. In the microcanonical ensemble, we see that this is not always the case: h(e, m) and m can have opposite 
signs, so that this ensemble must be nonequivalent with the canonical ensemble. 

To understand how this peculiarity of the cf> 4 model arises out of the nonconcavity of s(e,m), we only have to 
analyze the definition of h(e,m), given by Eq. (|53[) . There T(e,m) is always positive, since it is directly proportional 
to the kinetic energy, so that the sign of h(e,m) is always the opposite of the sign of ds(e,m)/dm. Therefore, in 
order for h(£,m) to be negative for m > 0, we must have ds(e,m) / dm > for m > 0. This, as can be seen by 
comparing Figs, [lja), [2] and IH can only happen if s(e,m) is nonconcave, i.e., if the graph of s(e, m) has a bimodal 
or "double-bump" shape as a function of the magnetization. 6 When s(e,m) is concave or unimodal, as is the case 
for e = 0.16, then ds(e,m)/dm is necessarily negative when m > (Fig. 2]), which implies that h(e,m) is necessarily 
positive when m > (Fig. [2]), in agreement with what is seen in the canonical ensemble. 

Another consequence of the nonconcavity of s(e,m) is that the magnetic susceptibility x(e,m), calculated micro- 
canonically at fixed values of e and m, can be negative for certain values of m and e; see Fig. [3] This, alone, is a sure 
sign that the microcanonical and canonical ensembles are nonequivalent, for we know that the magnetic susceptibility 
is always positive in the canonical ensemble. Indeed, we know that by increasing the magnetic field h while keeping the 
inverse temperature (3 constant, the equilibrium magnetization m(/3, h) of the canonical ensemble can only increase, 






/ \ e=-0.04 




£ = 0.16^^ 



For generic Hamiltonians that are not even in q, the magnetization and magnetic field could have opposite signs. What we report here 
is specific to the mean-field ifi 4 model, which has the parity in q. 

A similar argument can be used to show that h(e, m) can be positive for m < if and only if s(e, m) is nonconcave in m. 
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which means that 



dm((3, h) 
dh 



> 0. 



(54) 



(See the Appendix for a general proof of this result.) In the microcanonical ensemble, the role of h and m are reversed: 
what is varied is m not h, and the proper magnetic susceptibility to consider in this ensemble, defined as 



X(e, m) = 



dh(s, m) 
dm 



d 2 s ds 
dmde dm 



ds d 2 s \ 
de dm 2 J 



(55) 



can be negative, as demonstrated in Fig. [3j Physically, this means that, in the microcanonical ensemble, an increase 
in m can lead to a decrease of the microcanonical field h(e, m). This, as we have already mentioned, is related to the 
fact that s(e,m) is nonconcave as a function of m; however, it is important to note that x( £ > m ) does not become 
negative exactly when s(e,m) becomes nonconcave. Indeed, the local convexity properties of s(e,m) are determined 
by the eigenvalues of the Hessian matrix 



d 2 s/dm 2 d 2 s/dmde 
d 2 s/dedm d 2 s/de 2 



whose characteristic equation is 



d 2 s 
dm 2 



dm 2 de 2 



dmde 



= 



(56) 



(57) 



The sign of the eigenvalues of the Hessian changes when either the sign of the first-order term or of the zeroth-order 
term of the characteristic equation changes. This occurs on two lines in the (e, m) plane, which can either intersect 
at one point on the line where the susceptibility given in Eq. ([55)) changes sign, or not intersect at all. In either case, 
the change in the local convexity properties of s(e,m) and in the sign of xi, £ i m ) does not happen generically at the 
same value of m for a given value of e. This is also seen by comparing Figs. [3]and[3J 

To close our discussion of the mean-field </> 4 model, we now comment on the thermodynamic properties of this 
model obtained from the microcanonical ensemble in which only the energy is fixed. In this ensemble, the relevant 
entropy function to consider is the function s(e) defined in Eq. ([J). From the knowledge of s(e, m), we can derive s(e) 
by using the following contraction formula 0, [§] : 



s(e) = sups(e, m) 



(58) 



Figure [TJb) shows the positions of the maxima of s(e,m), which represent physically the equilibrium values of the 
magnetization in the microcanonical ensemble with fixed energy Ne. We see from this figure that s(e, m) has only one 
maximum located at m = when it is concave as a function of m. This happens when e > e c , so that s(e) = s(e, m = 0) 
for all e > e c . Below e c , the single maximum of s(e, m) splits in a continuous way into two symmetric maxima. The 
continuous character of the splitting can be deduced from the fact that the phase transition of the model, seen in the 
canonical ensemble having /3 as its only parameter, is known to be of second order [3fj| . This implies that the phase 
transition seen in the microcanonical ensemble as a function of e is also second-order, i.e., continuous. This must be 
so because a second-order phase transition in the canonical ensemble implies that the canonical and microcanonical 
ensembles are fully equivalent, since s(e) in this case is concave and has no affine parts, i.e., no straight (linear) lines 
in its graph [j| Q, [2j| [Hj]. These two features of s(e) are somewhat difficult to see from Fig. Q] however, they are 
confirmed by the theory of nonconcave entropies. The fact indeed is that if s(e) was nonconcave as a function of e, 
then the phase transition of the model would be first-order as a function of the temperature rather than second-order 
(25l l34j. The same applies if s(e) was affine: in this case, the phase transition in the canonical ensemble would also 
be first-order [2^, [34| , which is not what is reported by Dauxois et al. [3(| ■ 

The relationship between nonconcave entropies and first-order phase transitions is fully general, and can be applied 
to s(e,m) to conclude that the mean-field <fi model displays, in the canonical ensemble with temperature T and 
magnetic field h, a first-order, field-driven phase transition when T < T c = 2e c . The phase transition, in this case, 
is driven by the magnetic field h because the entropy s(e, m) is nonconcave as a function of the magnetization, the 
variable conjugated to h. Moreover, since h is basically the derivative of s(e, m), it should be clear from the symmetric, 
bimodal shape of s(e, m), seen again when e < e c , that the critical field of the phase transition is h c = 0. Physically, 
this means that if we set the temperature T to be below T c , then the equilibrium magnetization m(T, h) of the 
canonical ensemble jumps from a negative, non-zero value to a positive, non-zero value as h varies from 0~ to + . 
The "spontaneous" values of m(T, h) found for h = 0~ and h = + are nothing but the two values of m at which 
s(s, m) if maximum when e < e c . 
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VI. CONCLUSION 



The entropy of long-range systems can be concave as a function of the energy alone, and yet be nonconcave as 
a function of the energy and other invariant quantities. This was illustrated here for the mean-held 4 model: its 
entropy is concave as a function of the energy, but is nonconcave as a function of the energy and magnetization. This 
property of the entropy leads, as we have shown, to a fundamental difference between the thermodynamic properties 
of the model found in the microcanonical ensemble as a function of the energy and magnetization, and those found 
in the canonical ensemble as a function of the temperature and magnetic field. On the one hand, we have shown that 
the (effective) magnetic field of the microcanonical ensemble can have a sign opposite to that of the magnetization. 
In the canonical ensemble, this never happens, as the equilibrium magnetization in this ensemble is always in the 
direction of the applied magnetic field. On the other hand, we showed that the magnetic susceptibility at constant 
temperature, calculated microcanonically as a function of the energy and magnetization, can be negative, whereas it 
is always positive in the canonical ensemble. Because of these two differences, we say that the microcanonical and 
canonical ensembles are nonequivalent. 

There is a further consequence of the nonconcavity of the entropy that we have discussed, namely that the mean- 
field 4> 4 model displays a first-order phase transition driven by the magnetic field in the canonical ensemble. This 
phase transition is directly related to the fact that the entropy of the model is nonconcave as a function of the 
magnetization — the quantity conjugated to the magnetic field — for certain values of the energy. Thus, in addition to 
display a second-order phase transition driven by the temperature (or the energy), the mean- field <p A model displays 
a first-order phase transition driven by the magnetic field. The situation, as such, is similar to the two-dimensional 
Ising model, which also displays a second-order phase transition in temperature but a first-order phase transition in 
the magnetic field. There is, however, a fundamental difference between the Ising model and the mean-field <^> 4 model. 
Being a short-range interaction model, the Ising model has an entropy which is necessarily concave as a function of 
its energy and magnetization [35j . The first-order, field-driven phase transition of this model is thus not related to a 
nonconcavity of the entropy; it appears, in fact, because the entropy, when plotted as a function of the magnetization, 
has affine parts in the form of "plateaus." For more details on this point, we refer the reader to Sec. 11 of Ref. [36| 
and the references cited therein. 

Our last words of this paper will go to a technical remark about a possible way to compute the entropy s(s,m) 
using Legendre transforms. We mentioned in Sec.lITlthat because s(e, m) is nonconcave, it cannot be calculated as the 
Legendre transform of the energy function 77) or, equivalently, (p(f3,h), which is the free energy associated with 
the partition function shown in However, there is a subtle workaround to this problem. Indeed, although s(e, m) 
is nonconcave as a function of m, it is concave in e for all m. This suggests that the projections of s(e,m) along e 
can be calculated by taking the Legendre transform of a free energy function, which, unconventionally, is a function 
of the inverse temperature and the magnetization. The construction of this Legendre transform, which follow from 
the theory of Ellis et al. [1| , is outlined in Appendix [Bl 
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APPENDIX A: PROOF OF THE POSITIVITY OF X (P, h) 

The equilibrium magnetization m(/3, h), calculated in the canonical ensemble as a function of the inverse temperature 
(3 and magnetic field h, can be deduced from the equation 

where ip(f3,h) is the canonical free energy function associated with the partition function shown in ©. With this 
equation, the definition of the magnetic susceptibility h) in the canonical ensemble can be rewritten as 
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At this point, we note that </?(/?, h) is an always convex function of (3 and h, so that 

&<p(J3,h) 



dh? 

for all f3 and h. Consequently, h) > for all values of (3 > and ft, as claimed. 



< (A3) 



APPENDIX B: MIXED CANONICAL-MICROCANONICAL ENSEMBLE 



Define the partition function 

Z<fi,m)= I e- (}H ^5{m{x)-m)dx (Bl) 

and its associated free energy 



r 



(p((3,m) = - lim ^-lnZ(/3,m). (B2) 

Then 

s(e, m) = (3e— ip(/3, to), (B3) 

where the value of (3 is set by solving 

d<p{(3, to) 



d(3 



e. (B4) 



The last two equations define the Legendre transform that takes ip([3, to) to s(e, m). It is a valid Legendre transform 
for the mean-field </> 4 model because s(e, to) is concave as a function of e for all values of m for this model; see Fig. [T] 
Following Ellis et al. 0], Z(f3,m) and Lp((3,m) are to be interpreted, respectively, as the partition function and free 
energy function of a mixed canonical-microcanonical ensemble in which the energy is treated in a canonical way, while 
the magnetization is treated in a microcanonical way. The reader is referred to Ref. [4| for more details on the idea 
of mixed ensembles. 
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